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Not So Distant Future 


І suspect that the following 
problem A in computational 

geometry is іп P ..., 
what do you think? 
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Not So Distant Future 


Cluster of 10k CPUs is 
searching and reasoning 
over a knowledge base 
of 1M definitions, 20M 
theorems and proofs 
and 100B lemmas... 
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Not So Distant Future 


Indeed, it is similar to a less known problem B 
number 13501 in my knowledge base. We can use 
a similar polynomial reduction to planar graphs 
as іп В, and for the resulting constraint-solving 
problem we use a modified version У of the 
O(n?) algorithm X published last year in Proc. of 
Indian Conf. on Graph Theory. 
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Not So Distant Future 


Here is my verified formal proof with 
100k basic inference steps. Here are two 
high-level versions of the proof, one for 
experts and one for textbooks. 
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Not So Distant Future 


Btw., A, В and X, Y generalize to a far-reaching 
conjecture that could solve a long-standing open 
problem. 


Let's write an ERC proposal about exploring them! 
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How Distant? 


e 15-50 years, depending on our efforts 

e Today’s numbers about 100x smaller: 
e 10k-30k computer-understandable definitions 
e 200k-300k (small) theorems and proofs 
e 1B-10B primitive lemmas 


e Covers roughly the Bc level in Math/CS, PhD level still far 
e The main bottleneck: 


WEAK AUTOMATION OF REASONING 
OVER LARGE COMPUTER-UNDERSTANDABLE CORPORA 


e This is where a breakthrough is necessary 
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AI4REASON Goals 


e Breakthrough in a hard problem in Al and reasoning: 
automatically proving theorems in complex theories 


e Produce Al systems that combine learning and reasoning 
e Thus help with automating verification of: 
e advanced mathematics and big proofs (Kepler conjecture) 


e software and hardware designs (seL4 OS microkernel) 
e advanced systems and designs (finance, industry, science) 
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Example: The Kepler conjecture 


J. Kepler (1611, Prague): The most compact way of 
stacking balls of the same size in space is a pyramid. 


V . „Ф 


BEVE 


Big proof: 300 pages + computations (Hales, Fergusson, 1998) 
Formal proof finished in 2014, 20000 theorems & proofs 

All of it computer-understandable and verified in HOL Light: 
polyhedron s /\ c face_of s ==> polyhedron c 
However, this took 20 — 30 person-years! 

Our Al methods can fully automate 40% of the proofs (2014) 
Similar verification efforts for bug-free compilers, OS, etc. 
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Sample of Formal Math: Irrationality of V2 


itheorem sqrt2_not_rational: 
' "sqrt (real 2) € Q" 

proof 

H assume “sqrt (real 2) є Q" 


then obtain m n :: nat where 
n_nonzero: "п # О" and sqrt_rat: "|<аг+ (real 2)! = real m / real n" 
апа lowest_terms: "оса m n = 1" .. 
from n_nonzero and sqrt_rat have "real m = įsqrt (real 2)! * real n" by simp 
then have "real (m?) = (sqrt (real 2))? * real (n?)" 
by (auto simp add: power2_eq square) 
also have "(sqrt (real 2))? = real 2" by simp 
also have "... * real (m?) = real (2 * пт)” by simp 
finally have eq: "m? = 2 * n2" .. 
hence "2 dvd m?" . 
with мо іѕ prime have ауа т: "2 dvd т" by (rule ргіте ауа ромег мо) 
then obtain К where "т = 2 * k" 


with eq have "2 * п? = 2° * k?" by (auto simp add: ромег2 еда square ти1+ ас) 
hence "n? = 2 * k?" by simp 

hence "2 dvd п2" va 

with two_is_ prime have "2 dvd п" by (rule prime_dvd_power_two) 

with ауа т have "2 dvd gcd m п" by (rule gcd greatest) 

with lLowest_terms have "2 dvd 1" by simp 

thus False by arith 


let SQRT_2_ IRRATIONAL = prove 
(“rational (sqrt (&2))`, 
SIMP_TAC[rational; real_abs; SQRT_POS_LE; REAL_POS] THEN 
REWRITE_TAC[NOT_EXISTS_THM] THEN REPEAT СЕМ ТАС THEN 
DISCH_THEN (CONJUNCTS_THEN2 ASSUME_TAC MP_TAC) THEN 
SUBGOAL_THEN *~((&p / &q) pow 2 = sqrt(&2) pow 2) * 
(fun th -> MESON_TAC[th]) THEN 
SIMP_TAC[SQRT_POW_2; REAL_POS; REAL_POW_DIV] THEN 
ASM_SIMP_TAC[REAL_EQ_LDIV_EQ; REAL_OF_NUM_LT; REAL_POW_LT; 
ARITH_RULE `0 < q <=> ~(q = 0) `] THEN 
ASM_MESON_TAC[NSQRT_2; REAL_OF_NUM_POW; 
REAL_OF_NUM_MUL; REAL_OF_NUM_EQ]);; 
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The AI4REASON Plan of Attack 


WP! Al for finding relevant knowledge in large formal corpora: 

e How to capture similarity and analogy of ideas? 

e How to learn from proofs, counter-examples and theories? 
WP2 Al-based guiding methods for reasoning tools: 

e How to efficiently apply the learned guidance? 

e How to automatically learn the best reasoning strategies? 
WP3 Al for suggesting plausible conjectures and concepts: 

e What makes a good conjecture for a given problem? 

e What concepts are good for a given problem? 
WP4 Self-improving Al interleaving learning and deduction: 

e How to explore easier problems to learn for harder ones? 

e How to develop theories and gain most useful knowledge? 
WP5 Deployment and Cross-Corpora Reuse: 


e Deploy the methods as strong online services 
e Translate informal math to formal 
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Combining Learning and Theorem Proving 


high-level: select relevant lemmas from a large library 
high-level: select good high-level strategies for a problem 
low-level: guide all inference steps of theorem provers 
mid-level: guide application of tactics to goals 

mid-level: invent suitable strategies for problem classes 
mid-level: invent suitable conjectures for a problem 
mid-level: invent suitable concepts/models for problems 
proof sketches: explore related theories to get proof ideas 
theory exploration: develop new theories by conjecturing 
feedback loops: (dis)prove, learn from it, (dis)prove more 
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Some Highlights So Far 


Won two divisions of the 2018 proving competition (CASC) 
2017/18: Improved the best open prover by ML guidance 


2018: 40% improvement of the leanCoP prover by 
reinforcement learning 


2017-18: Tacticloe — first ML-guided tactical system 
2015-18: Blind Strategymaker - invent proving strategies 
First deep-learning based provers (with Google Research) 


2015-18: Inf2formal — Translating informal math to formal, 
using grammar-based/semantic and neural systems 


Invited talks — Fields Inst., TYPES’18, Hales’60, AGI'18 
2016 Google Research Award for JU 

AITP conference series started: aitp—-conference.org 
Al/TP group at Google Research (2016), OpenAl - 2018? 
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Team and Collaborations 


Chad Brown, Jan Jakubův, Martin Suda, Thibault Gauthier, 
Bartosz Piotrowski, Zarathustra Goertzel, Shawn Wang 
External scientific advisors 
e Prof. Stephan Schulz (Autom. reasoning, DHBW Stuttgart) 
e Prof. Robert Veroff (Autom. reasoning, U. of New Mexico) 
e Prof. Tom Heskes (Al, Radboud U. Nijmegen) 
Further Collaborations 
e Dr. Cezary Kaliszyk, U. of Innsbruck (ERC in 2016) 
Dr. Jasmin Blanchette, VU Amsterdam (ERC in 2016) 
Prof. Larry Paulson, U. of Cambridge (ERC in 2017) 
Prof. Geoff Sutcliffe, U. of Miami 
Dr. Christian Szegedy, Google Research 
Prof. Herman Geuvers, Radboud U. Nijmegen 


over 20 research visits so far 
large related national grant awarded to JU in 2017 
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Future Potential - Science 


e Use strong Al/reasoning and formal verification for: 
e Science 
e Routinely verify complex math, software, hardware? 
e Make all of math/science computer-understandable? 
e Strong Al assistants for math/science? 
e Examples 
e Automatically understand/verify/explain all arXiv papers? 
• Can we train a superhuman system like AlphaGo/Zero for 
math/physics? What will it take? 
e Can we prove that the Amazon Cloud cannot be hacked? 
e The same for critical government/private IT systems? 
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Future Potential - Society 


e Use strong Al/reasoning and formal verification for: 

e Society 

Leibniz’s dream: Let us Calculate! (solve any dispute) 
J. McCarthy: Mathem. Objectivity and the Power of Initiative 
Al/reasoning assistants for law/regulations 

Verification of financial, transport/traffic systems, ... 
Explainable and very securely verified systems 

e Examples 

Prove that two Paris metro trains will never crash? 
Prove that a trading system doesn’t violate regulations? 
Prove that a new law is inconsistent with an old one? 
Automatically debunk fallacies in political campaigns? 
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Possible Pitfalls and Avoiding Them 


Keep informed, don’t fall for the hype 


Al is much more than just (deep) learning/neural nets 


E.g., SAT/SMT/model-checking may be one of the biggest 
recent Al successes — Amazon, Facebook, Microsoft, etc. 


Don’t expect miracles/singularity due to the current hype 
We can train image recognition & language models, but ... 
.. don’t know what it takes to solve hard science problems 
However, some breakthroughs can happen quickly 
Researchers/society/lawmakers need to talk more/faster 
Al infrastructure for EU (CLAIRE) could serve this purpose 
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Possible Pitfalls and Avoiding Them 


Don't let US, China, ... 


... take away the best EU science minds 
In reasoning and formal methods EU is the leader! 


Make a deal with big Al companies to seriously support 
open university-based research 


Example: PRAIRIE institute in Paris, 

... CLAIRE centers modelled after that? 

Infrastructure like CLAIRE very needed in countries like CR 
Larger brain-drain and local incompetence aggravating it 
Use such infrastructure to impose EU values on Al 
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Links and Impacts on Other AI Areas 


Main areas: Machine Learning, Automated Reasoning 
Needs advances in Representation Learning 

Al needs intuition, but also reasoning and explanations 
Impact on Formal Verification (SW, HW, etc.) 
Potentially on any (hard) science/thinking/arguing 

Alan Turing, 1950, Al: 


“We may hope that machines will eventually compete with 
men in all purely intellectual fields.” 
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Outlook - Bets from 2014 


• In 20 years, 80% of Flyspeck and Mizar toplevel theorems 
will be provable automatically (about 40% in 2014) 

e The same in 30 years - lII give you 2:1, In 10 years: 60% 
(getting there) 

e In 25 years, 50% of the toplevel statements іп ATEX-written 
Msc-level math curriculum textbooks will be parsed 
automatically and with correct formal semantics 
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Outlook — Scientific Revolution 


(from a talk about Kepler and Hales) 

What did Kepler, Galileo & Co start to do in 1600s? 

What are we trying to do today? 

Kepler’s Conjecture in Strena in 1611 (with many others) 
Kepler's laws, Newton, ..., age of science, math, machines 
..., Hilbert, ..., Turing, ... age of computing machines? 
1998 machine helps to find a proof of Kepler’s Conjecture 
2014 machine verifies a proof of Kepler’s Conjecture 

... 2050? machine finds a proof of Keplers Conjecture? 
(no betting ;-) 
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Thanks and Advertisement 


Thanks for your attention! 
AITP — Artificial Intelligence and Theorem Proving 
April 7-12, 2019, Obergurgl, Austria, 


aitp-conference.org 


ATP/ITP/Math vs Al/Machine-Learning people, 
Computational linguists 


Discussion-oriented and experimental 
Grown to 60 people in 2018 
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